Dataset statistics
| Number of variables | 20 |
|---|---|
| Number of observations | 694409 |
| Missing cells | 1712237 |
| Missing cells (%) | 12.3% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 106.0 MiB |
| Average record size in memory | 160.0 B |
Variable types
| Categorical | 10 |
|---|---|
| Numeric | 7 |
| Unsupported | 2 |
| Boolean | 1 |
State has constant value "Niger" | Constant |
AFTERNOON has constant value "0" | Constant |
ADR_IDS has constant value "6,1" | Constant |
Regimen has a high cardinality: 90 distinct values | High cardinality |
PHARMACY_ID is highly correlated with PATIENT_ID | High correlation |
PATIENT_ID is highly correlated with PHARMACY_ID and 1 other fields | High correlation |
FACILITY_ID is highly correlated with PATIENT_ID and 2 other fields | High correlation |
EVENING is highly correlated with FACILITY_ID and 1 other fields | High correlation |
ADHERENCE is highly correlated with FACILITY_ID and 1 other fields | High correlation |
PHARMACY_ID is highly correlated with PATIENT_ID and 1 other fields | High correlation |
PATIENT_ID is highly correlated with PHARMACY_ID | High correlation |
FACILITY_ID is highly correlated with ADHERENCE | High correlation |
EVENING is highly correlated with PHARMACY_ID and 1 other fields | High correlation |
ADHERENCE is highly correlated with FACILITY_ID and 1 other fields | High correlation |
PHARMACY_ID is highly correlated with PATIENT_ID | High correlation |
PATIENT_ID is highly correlated with PHARMACY_ID | High correlation |
EVENING is highly correlated with ADHERENCE | High correlation |
ADHERENCE is highly correlated with EVENING | High correlation |
Facility Name is highly correlated with PATIENT_ID and 5 other fields | High correlation |
Regimen Line is highly correlated with Regimen | High correlation |
PATIENT_ID is highly correlated with Facility Name and 5 other fields | High correlation |
L.G.A is highly correlated with Facility Name and 5 other fields | High correlation |
DMOC_TYPE is highly correlated with PHARMACY_ID | High correlation |
Regimen is highly correlated with Facility Name and 5 other fields | High correlation |
PHARMACY_ID is highly correlated with Facility Name and 5 other fields | High correlation |
FACILITY_ID is highly correlated with Facility Name and 2 other fields | High correlation |
ADHERENCE is highly correlated with Facility Name and 4 other fields | High correlation |
State is highly correlated with DMOC_TYPE and 9 other fields | High correlation |
DMOC_TYPE is highly correlated with State and 3 other fields | High correlation |
Regimen is highly correlated with State and 4 other fields | High correlation |
Facility Name is highly correlated with State and 4 other fields | High correlation |
Regimen Line is highly correlated with State and 3 other fields | High correlation |
ADR_SCREENED is highly correlated with State and 2 other fields | High correlation |
AFTERNOON is highly correlated with State and 9 other fields | High correlation |
ADR_IDS is highly correlated with State and 9 other fields | High correlation |
PRESCRIP_ERROR is highly correlated with State and 2 other fields | High correlation |
L.G.A is highly correlated with State and 4 other fields | High correlation |
ADHERENCE is highly correlated with State and 6 other fields | High correlation |
ADR_SCREENED has 502740 (72.4%) missing values | Missing |
ADR_IDS has 694406 (> 99.9%) missing values | Missing |
DMOC_TYPE has 515067 (74.2%) missing values | Missing |
DURATION is highly skewed (γ1 = 65.07331267) | Skewed |
MORNING is highly skewed (γ1 = 234.8112474) | Skewed |
BODY_WEIGHT is highly skewed (γ1 = 20.88914214) | Skewed |
PHARMACY_ID has unique values | Unique |
DATE_VISIT is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
NEXT_APPOINTMENT is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
MORNING has 561743 (80.9%) zeros | Zeros |
EVENING has 547892 (78.9%) zeros | Zeros |
BODY_WEIGHT has 691674 (99.6%) zeros | Zeros |
Reproduction
| Analysis started | 2021-06-15 09:01:38.570413 |
|---|---|
| Analysis finished | 2021-06-15 09:03:12.291346 |
| Duration | 1 minute and 33.72 seconds |
| Software version | pandas-profiling v3.0.0 |
| Download configuration | config.json |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.3 MiB |
| Niger |
|---|
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Characters and Unicode
| Total characters | 3472045 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Niger |
|---|---|
| 2nd row | Niger |
| 3rd row | Niger |
| 4th row | Niger |
| 5th row | Niger |
Common Values
| Value | Count | Frequency (%) |
| Niger | 694409 |
Length
Pie chart
| Value | Count | Frequency (%) |
| niger | 694409 |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 694409 | |
| i | 694409 | |
| g | 694409 | |
| e | 694409 | |
| r | 694409 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2777636 | |
| Uppercase Letter | 694409 | 20.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 694409 | |
| g | 694409 | |
| e | 694409 | |
| r | 694409 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 694409 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3472045 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 694409 | |
| i | 694409 | |
| g | 694409 | |
| e | 694409 | |
| r | 694409 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3472045 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 694409 | |
| i | 694409 | |
| g | 694409 | |
| e | 694409 | |
| r | 694409 |
| Distinct | 14 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.3 MiB |
| Bida | |
|---|---|
| Kontagora | |
| Lapai | |
| Borgu | |
| Rafi | |
| Other values (9) |
Length
| Max length | 9 |
|---|---|
| Median length | 5 |
| Mean length | 5.465787454 |
| Min length | 4 |
Characters and Unicode
| Total characters | 3795492 |
|---|---|
| Distinct characters | 27 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Magama |
|---|---|
| 2nd row | Magama |
| 3rd row | Magama |
| 4th row | Magama |
| 5th row | Magama |
Common Values
| Value | Count | Frequency (%) |
| Bida | 196055 | |
| Kontagora | 93113 | |
| Lapai | 76673 | 11.0% |
| Borgu | 72502 | 10.4% |
| Rafi | 55176 | 7.9% |
| Mokwa | 49129 | 7.1% |
| Rijau | 43269 | 6.2% |
| Shiroro | 38560 | 5.6% |
| Wushishi | 34446 | 5.0% |
| Magama | 16402 | 2.4% |
| Other values (4) | 19084 | 2.7% |
Length
| Value | Count | Frequency (%) |
| bida | 196055 | |
| kontagora | 93113 | |
| lapai | 76673 | 11.0% |
| borgu | 72502 | 10.4% |
| rafi | 55176 | 7.9% |
| mokwa | 49129 | 7.1% |
| rijau | 43269 | 6.2% |
| shiroro | 38560 | 5.6% |
| wushishi | 34446 | 5.0% |
| magama | 16402 | 2.4% |
| Other values (4) | 19084 | 2.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 754031 | |
| i | 481165 | |
| o | 384977 | |
| B | 268557 | 7.1% |
| r | 245275 | 6.5% |
| d | 196055 | 5.2% |
| g | 185970 | 4.9% |
| u | 166761 | 4.4% |
| h | 108865 | 2.9% |
| n | 108244 | 2.9% |
| Other values (17) | 895592 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3101083 | |
| Uppercase Letter | 694409 | 18.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 754031 | |
| i | 481165 | |
| o | 384977 | |
| r | 245275 | 7.9% |
| d | 196055 | 6.3% |
| g | 185970 | 6.0% |
| u | 166761 | 5.4% |
| h | 108865 | 3.5% |
| n | 108244 | 3.5% |
| t | 93113 | 3.0% |
| Other values (10) | 376627 |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 268557 | |
| R | 98445 | 14.2% |
| K | 93113 | 13.4% |
| L | 89864 | 12.9% |
| M | 71424 | 10.3% |
| S | 38560 | 5.6% |
| W | 34446 | 5.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3795492 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 754031 | |
| i | 481165 | |
| o | 384977 | |
| B | 268557 | 7.1% |
| r | 245275 | 6.5% |
| d | 196055 | 5.2% |
| g | 185970 | 4.9% |
| u | 166761 | 4.4% |
| h | 108865 | 2.9% |
| n | 108244 | 2.9% |
| Other values (17) | 895592 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3795492 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 754031 | |
| i | 481165 | |
| o | 384977 | |
| B | 268557 | 7.1% |
| r | 245275 | 6.5% |
| d | 196055 | 5.2% |
| g | 185970 | 4.9% |
| u | 166761 | 4.4% |
| h | 108865 | 2.9% |
| n | 108244 | 2.9% |
| Other values (17) | 895592 |
| Distinct | 20 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.3 MiB |
| Federal Medical Centre - Bida | |
|---|---|
| General Hospital Kontagora | |
| General Hospital -Bida | |
| General Hospital - Lapai | |
| General Hospital - New Bussa | |
| Other values (15) |
Length
| Max length | 43 |
|---|---|
| Median length | 26 |
| Mean length | 23.54726105 |
| Min length | 8 |
Characters and Unicode
| Total characters | 16351430 |
|---|---|
| Distinct characters | 41 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Rural Hosp- Auna |
|---|---|
| 2nd row | Rural Hosp- Auna |
| 3rd row | Rural Hosp- Auna |
| 4th row | Rural Hosp- Auna |
| 5th row | Rural Hosp- Auna |
Common Values
| Value | Count | Frequency (%) |
| Federal Medical Centre - Bida | 110430 | |
| General Hospital Kontagora | 87583 | |
| General Hospital -Bida | 85625 | |
| General Hospital - Lapai | 71906 | |
| General Hospital - New Bussa | 69864 | |
| General Hospital - Kagara | 55176 | |
| G. Hosp Mokwa | 49129 | |
| General Hospital T Magajiya | 43269 | 6.2% |
| Rural Hosp | 38560 | 5.6% |
| CHC Zungeru | 33081 | 4.8% |
| Other values (10) | 49786 |
Length
| Value | Count | Frequency (%) |
| general | 444115 | |
| hospital | 432864 | |
| 307376 | ||
| bida | 196055 | 7.6% |
| centre | 117373 | 4.5% |
| medical | 115960 | 4.5% |
| federal | 110430 | 4.3% |
| hosp | 103686 | 4.0% |
| kontagora | 95653 | 3.7% |
| lapai | 71906 | 2.8% |
| Other values (30) | 600731 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 2158388 | |
| 1901740 | 11.6% | |
| e | 1593001 | 9.7% |
| l | 1159798 | 7.1% |
| i | 937002 | 5.7% |
| r | 918334 | 5.6% |
| o | 793359 | 4.9% |
| n | 696743 | 4.3% |
| s | 696509 | 4.3% |
| t | 673685 | 4.1% |
| Other values (31) | 4822871 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 11625314 | |
| Uppercase Letter | 2375275 | 14.5% |
| Space Separator | 1901740 | 11.6% |
| Dash Punctuation | 399972 | 2.4% |
| Other Punctuation | 49129 | 0.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2158388 | |
| e | 1593001 | |
| l | 1159798 | |
| i | 937002 | |
| r | 918334 | |
| o | 793359 | 6.8% |
| n | 696743 | 6.0% |
| s | 696509 | 6.0% |
| t | 673685 | 5.8% |
| p | 608456 | 5.2% |
| Other values (11) | 1390039 |
Uppercase Letter
| Value | Count | Frequency (%) |
| H | 578449 | |
| G | 498011 | |
| B | 271097 | |
| M | 215951 | 9.1% |
| C | 190940 | 8.0% |
| K | 177211 | 7.5% |
| F | 110430 | 4.6% |
| N | 84825 | 3.6% |
| L | 71906 | 3.0% |
| R | 43306 | 1.8% |
| Other values (7) | 133149 | 5.6% |
Space Separator
| Value | Count | Frequency (%) |
| 1901740 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 399972 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 49129 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 14000589 | |
| Common | 2350841 | 14.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2158388 | |
| e | 1593001 | |
| l | 1159798 | 8.3% |
| i | 937002 | 6.7% |
| r | 918334 | 6.6% |
| o | 793359 | 5.7% |
| n | 696743 | 5.0% |
| s | 696509 | 5.0% |
| t | 673685 | 4.8% |
| p | 608456 | 4.3% |
| Other values (28) | 3765314 |
Common
| Value | Count | Frequency (%) |
| 1901740 | ||
| - | 399972 | 17.0% |
| . | 49129 | 2.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 16351430 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 2158388 | |
| 1901740 | 11.6% | |
| e | 1593001 | 9.7% |
| l | 1159798 | 7.1% |
| i | 937002 | 5.7% |
| r | 918334 | 5.6% |
| o | 793359 | 4.9% |
| n | 696743 | 4.3% |
| s | 696509 | 4.3% |
| t | 673685 | 4.1% |
| Other values (31) | 4822871 |
| Distinct | 13 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.3 MiB |
| ART First Line Adult | |
|---|---|
| Cotrimoxazole (CTX) Prophylaxis | 47727 |
| Isoniazid Preventive Therapy (IPT) | 18706 |
| ART First Line Children | 12906 |
| ART Second Line Adult | 9624 |
| Other values (8) | 7009 |
Length
| Max length | 46 |
|---|---|
| Median length | 20 |
| Mean length | 21.3161523 |
| Min length | 4 |
Characters and Unicode
| Total characters | 14802128 |
|---|---|
| Distinct characters | 41 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | ART First Line Adult |
|---|---|
| 2nd row | ART First Line Adult |
| 3rd row | ART First Line Adult |
| 4th row | ART First Line Adult |
| 5th row | ART First Line Adult |
Common Values
| Value | Count | Frequency (%) |
| ART First Line Adult | 598437 | |
| Cotrimoxazole (CTX) Prophylaxis | 47727 | 6.9% |
| Isoniazid Preventive Therapy (IPT) | 18706 | 2.7% |
| ART First Line Children | 12906 | 1.9% |
| ART Second Line Adult | 9624 | 1.4% |
| ARV Prophylaxis for Pregnant Women | 2402 | 0.3% |
| Other anti-infectives (including STI Medicine) | 1926 | 0.3% |
| Other Medicines | 1217 | 0.2% |
| ART Second Line Children | 1089 | 0.2% |
| OI Treatment | 233 | < 0.1% |
| Other values (3) | 142 | < 0.1% |
Length
| Value | Count | Frequency (%) |
| line | 622192 | |
| art | 622056 | |
| first | 611343 | |
| adult | 608066 | |
| prophylaxis | 50129 | 1.8% |
| cotrimoxazole | 47727 | 1.7% |
| ctx | 47727 | 1.7% |
| preventive | 18706 | 0.7% |
| ipt | 18706 | 0.7% |
| isoniazid | 18706 | 0.7% |
| Other values (18) | 65699 | 2.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2036648 | ||
| i | 1417556 | 9.6% |
| t | 1295715 | 8.8% |
| A | 1232524 | 8.3% |
| e | 788012 | 5.3% |
| r | 768928 | 5.2% |
| l | 721843 | 4.9% |
| T | 709500 | 4.8% |
| n | 702603 | 4.7% |
| s | 683321 | 4.6% |
| Other values (31) | 4445478 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 8516159 | |
| Uppercase Letter | 4110677 | |
| Space Separator | 2036648 | 13.8% |
| Open Punctuation | 68359 | 0.5% |
| Close Punctuation | 68359 | 0.5% |
| Dash Punctuation | 1926 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 1417556 | |
| t | 1295715 | |
| e | 788012 | |
| r | 768928 | |
| l | 721843 | |
| n | 702603 | |
| s | 683321 | |
| d | 656685 | |
| u | 609992 | |
| o | 227533 | 2.7% |
| Other values (11) | 643971 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 1232524 | |
| T | 709500 | |
| R | 624458 | |
| L | 622192 | |
| F | 611343 | |
| C | 109449 | 2.7% |
| P | 89945 | 2.2% |
| X | 47727 | 1.2% |
| I | 39571 | 1.0% |
| S | 12639 | 0.3% |
| Other values (6) | 11329 | 0.3% |
Space Separator
| Value | Count | Frequency (%) |
| 2036648 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 68359 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 68359 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1926 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 12626836 | |
| Common | 2175292 | 14.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 1417556 | 11.2% |
| t | 1295715 | 10.3% |
| A | 1232524 | 9.8% |
| e | 788012 | 6.2% |
| r | 768928 | 6.1% |
| l | 721843 | 5.7% |
| T | 709500 | 5.6% |
| n | 702603 | 5.6% |
| s | 683321 | 5.4% |
| d | 656685 | 5.2% |
| Other values (27) | 3650149 |
Common
| Value | Count | Frequency (%) |
| 2036648 | ||
| ( | 68359 | 3.1% |
| ) | 68359 | 3.1% |
| - | 1926 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 14802128 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2036648 | ||
| i | 1417556 | 9.6% |
| t | 1295715 | 8.8% |
| A | 1232524 | 8.3% |
| e | 788012 | 5.3% |
| r | 768928 | 5.2% |
| l | 721843 | 4.9% |
| T | 709500 | 4.8% |
| n | 702603 | 4.7% |
| s | 683321 | 4.6% |
| Other values (31) | 4445478 |
| Distinct | 90 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.3 MiB |
| TDF(300mg)+3TC(300mg)+DTG(50mg) | |
|---|---|
| AZT(300mg)+3TC(150mg)+ABC(300mg) | |
| TDF(300mg)+3TC(300mg)+LPV/r(200/50mg) | |
| Cotrimoxazole 960mg | |
| TDF(300mg)+3TC(300mg)+EFV(600mg) | |
| Other values (85) |
Length
| Max length | 62 |
|---|---|
| Median length | 32 |
| Mean length | 30.99477109 |
| Min length | 10 |
Characters and Unicode
| Total characters | 21523048 |
|---|---|
| Distinct characters | 56 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | TDF(300mg)+3TC(300mg)+DTG(50mg) |
|---|---|
| 2nd row | TDF(300mg)+3TC(300mg)+DTG(50mg) |
| 3rd row | TDF(300mg)+3TC(300mg)+DTG(50mg) |
| 4th row | TDF(300mg)+3TC(300mg)+DTG(50mg) |
| 5th row | TDF(300mg)+3TC(300mg)+DTG(50mg) |
Common Values
| Value | Count | Frequency (%) |
| TDF(300mg)+3TC(300mg)+DTG(50mg) | 206616 | |
| AZT(300mg)+3TC(150mg)+ABC(300mg) | 170496 | |
| TDF(300mg)+3TC(300mg)+LPV/r(200/50mg) | 122259 | |
| Cotrimoxazole 960mg | 46009 | 6.6% |
| TDF(300mg)+3TC(300mg)+EFV(600mg) | 26837 | 3.9% |
| TDF/FTC(300/200mg)+NVP(200mg) | 21423 | 3.1% |
| AZT(300mg)+3TC(150mg)+NVP(200mg) | 19513 | 2.8% |
| Isoniazid 300mg | 17964 | 2.6% |
| TDF/FTC(300/200mg)+EFV(600mg) | 16308 | 2.3% |
| AZT(300mg)+3TC(150mg)+EFV(600mg) | 8994 | 1.3% |
| Other values (80) | 37990 | 5.5% |
Length
| Value | Count | Frequency (%) |
| tdf(300mg)+3tc(300mg)+dtg(50mg | 206616 | |
| azt(300mg)+3tc(150mg)+abc(300mg | 170496 | |
| tdf(300mg)+3tc(300mg)+lpv/r(200/50mg | 122259 | |
| cotrimoxazole | 47809 | 6.3% |
| 960mg | 46009 | 6.0% |
| tdf(300mg)+3tc(300mg)+efv(600mg | 26837 | 3.5% |
| tdf/ftc(300/200mg)+nvp(200mg | 21423 | 2.8% |
| azt(300mg)+3tc(150mg)+nvp(200mg | 19513 | 2.6% |
| isoniazid | 18642 | 2.4% |
| 300mg | 17964 | 2.4% |
| Other values (96) | 65704 | 8.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3520062 | |
| m | 1951692 | |
| g | 1895039 | |
| ( | 1825826 | 8.5% |
| ) | 1825826 | 8.5% |
| 3 | 1764673 | 8.2% |
| T | 1456622 | 6.8% |
| + | 1201249 | 5.6% |
| C | 849591 | 3.9% |
| D | 616749 | 2.9% |
| Other values (46) | 4615719 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 6442623 | |
| Uppercase Letter | 5090070 | |
| Lowercase Letter | 4700825 | |
| Open Punctuation | 1825826 | 8.5% |
| Close Punctuation | 1825826 | 8.5% |
| Math Symbol | 1201249 | 5.6% |
| Other Punctuation | 367766 | 1.7% |
| Space Separator | 68863 | 0.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| m | 1951692 | |
| g | 1895039 | |
| r | 183812 | 3.9% |
| o | 164731 | 3.5% |
| i | 90638 | 1.9% |
| a | 69908 | 1.5% |
| z | 66614 | 1.4% |
| l | 58804 | 1.3% |
| t | 50275 | 1.1% |
| x | 49795 | 1.1% |
| Other values (12) | 119517 | 2.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 1456622 | |
| C | 849591 | |
| D | 616749 | |
| F | 500755 | 9.8% |
| A | 391432 | 7.7% |
| V | 241030 | 4.7% |
| G | 210511 | 4.1% |
| Z | 208073 | 4.1% |
| P | 179810 | 3.5% |
| B | 177920 | 3.5% |
| Other values (8) | 257577 | 5.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3520062 | |
| 3 | 1764673 | |
| 5 | 548098 | 8.5% |
| 1 | 227030 | 3.5% |
| 2 | 220329 | 3.4% |
| 6 | 105280 | 1.6% |
| 9 | 46009 | 0.7% |
| 4 | 9584 | 0.1% |
| 8 | 1461 | < 0.1% |
| 7 | 97 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 367760 | |
| , | 6 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1825826 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1825826 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 1201249 |
Space Separator
| Value | Count | Frequency (%) |
| 68863 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 11732153 | |
| Latin | 9790895 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| m | 1951692 | |
| g | 1895039 | |
| T | 1456622 | |
| C | 849591 | |
| D | 616749 | 6.3% |
| F | 500755 | 5.1% |
| A | 391432 | 4.0% |
| V | 241030 | 2.5% |
| G | 210511 | 2.2% |
| Z | 208073 | 2.1% |
| Other values (30) | 1469401 |
Common
| Value | Count | Frequency (%) |
| 0 | 3520062 | |
| ( | 1825826 | |
| ) | 1825826 | |
| 3 | 1764673 | |
| + | 1201249 | 10.2% |
| 5 | 548098 | 4.7% |
| / | 367760 | 3.1% |
| 1 | 227030 | 1.9% |
| 2 | 220329 | 1.9% |
| 6 | 105280 | 0.9% |
| Other values (6) | 126020 | 1.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 21523048 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3520062 | |
| m | 1951692 | |
| g | 1895039 | |
| ( | 1825826 | 8.5% |
| ) | 1825826 | 8.5% |
| 3 | 1764673 | 8.2% |
| T | 1456622 | 6.8% |
| + | 1201249 | 5.6% |
| C | 849591 | 3.9% |
| D | 616749 | 2.9% |
| Other values (46) | 4615719 |
PHARMACY_ID
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONUNIQUE| Distinct | 694409 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1831938.885 |
| Minimum | 30577 |
|---|---|
| Maximum | 4082396 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.3 MiB |
Quantile statistics
| Minimum | 30577 |
|---|---|
| 5-th percentile | 78221.4 |
| Q1 | 253799 |
| median | 2026173 |
| Q3 | 3248503 |
| 95-th percentile | 3806789.6 |
| Maximum | 4082396 |
| Range | 4051819 |
| Interquartile range (IQR) | 2994704 |
Descriptive statistics
| Standard deviation | 1409940.293 |
|---|---|
| Coefficient of variation (CV) | 0.7696437388 |
| Kurtosis | -1.622628204 |
| Mean | 1831938.885 |
| Median Absolute Deviation (MAD) | 1494246 |
| Skewness | -0.05367781955 |
| Sum | 1.272114849 × 1012 |
| Variance | 1.987931629 × 1012 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 3455056 | 1 | < 0.1% |
| 509357 | 1 | < 0.1% |
| 495012 | 1 | < 0.1% |
| 2590117 | 1 | < 0.1% |
| 2596262 | 1 | < 0.1% |
| 2594215 | 1 | < 0.1% |
| 2384273 | 1 | < 0.1% |
| 2614697 | 1 | < 0.1% |
| 2618795 | 1 | < 0.1% |
| 2608556 | 1 | < 0.1% |
| Other values (694399) | 694399 |
| Value | Count | Frequency (%) |
| 30577 | 1 | |
| 30578 | 1 | |
| 30579 | 1 | |
| 30580 | 1 | |
| 30581 | 1 | |
| 30582 | 1 | |
| 30583 | 1 | |
| 30584 | 1 | |
| 30585 | 1 | |
| 30586 | 1 |
| Value | Count | Frequency (%) |
| 4082396 | 1 | |
| 4082395 | 1 | |
| 4082394 | 1 | |
| 4082393 | 1 | |
| 4082389 | 1 | |
| 4082388 | 1 | |
| 4082387 | 1 | |
| 4082177 | 1 | |
| 4082173 | 1 | |
| 4082170 | 1 |
| Distinct | 19002 |
|---|---|
| Distinct (%) | 2.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 63993.45849 |
| Minimum | 8217 |
|---|---|
| Maximum | 160854 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.3 MiB |
Quantile statistics
| Minimum | 8217 |
|---|---|
| 5-th percentile | 9304 |
| Q1 | 13232 |
| median | 19731 |
| Q3 | 111670 |
| 95-th percentile | 144970 |
| Maximum | 160854 |
| Range | 152637 |
| Interquartile range (IQR) | 98438 |
Descriptive statistics
| Standard deviation | 53895.07033 |
|---|---|
| Coefficient of variation (CV) | 0.8421965558 |
| Kurtosis | -1.659011182 |
| Mean | 63993.45849 |
| Median Absolute Deviation (MAD) | 11105 |
| Skewness | 0.2691505114 |
| Sum | 4.443763352 × 1010 |
| Variance | 2904678606 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 143852 | 372 | 0.1% |
| 143759 | 360 | 0.1% |
| 145143 | 344 | < 0.1% |
| 144092 | 321 | < 0.1% |
| 145763 | 320 | < 0.1% |
| 144641 | 314 | < 0.1% |
| 145451 | 304 | < 0.1% |
| 143911 | 293 | < 0.1% |
| 144749 | 290 | < 0.1% |
| 143741 | 285 | < 0.1% |
| Other values (18992) | 691206 |
| Value | Count | Frequency (%) |
| 8217 | 7 | < 0.1% |
| 8218 | 36 | |
| 8219 | 25 | |
| 8220 | 23 | |
| 8221 | 42 | |
| 8222 | 15 | < 0.1% |
| 8223 | 42 | |
| 8224 | 7 | < 0.1% |
| 8225 | 20 | |
| 8226 | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 160854 | 3 | |
| 160849 | 4 | |
| 160837 | 5 | |
| 160836 | 5 | |
| 160835 | 4 | |
| 160833 | 3 | |
| 160759 | 5 | |
| 160717 | 5 | |
| 160715 | 4 | |
| 160714 | 5 |
| Distinct | 20 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9052.747443 |
| Minimum | 3005 |
|---|---|
| Maximum | 10026 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.3 MiB |
Quantile statistics
| Minimum | 3005 |
|---|---|
| 5-th percentile | 3005 |
| Q1 | 10013 |
| median | 10017 |
| Q3 | 10023 |
| 95-th percentile | 10025 |
| Maximum | 10026 |
| Range | 7021 |
| Interquartile range (IQR) | 10 |
Descriptive statistics
| Standard deviation | 2417.144705 |
|---|---|
| Coefficient of variation (CV) | 0.2670067535 |
| Kurtosis | 2.41941982 |
| Mean | 9052.747443 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | -2.102234411 |
| Sum | 6286309299 |
| Variance | 5842588.523 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10023 | 110430 | |
| 3005 | 87583 | |
| 10013 | 85625 | |
| 10016 | 71906 | |
| 10022 | 69864 | |
| 10014 | 55176 | |
| 10025 | 49129 | |
| 10017 | 43269 | 6.2% |
| 10024 | 38560 | 5.6% |
| 10018 | 33081 | 4.8% |
| Other values (10) | 49786 |
| Value | Count | Frequency (%) |
| 3005 | 87583 | |
| 3007 | 2540 | 0.4% |
| 3008 | 5530 | 0.8% |
| 10010 | 1365 | 0.2% |
| 10011 | 1441 | 0.2% |
| 10012 | 4767 | 0.7% |
| 10013 | 85625 | |
| 10014 | 55176 | |
| 10015 | 14961 | 2.2% |
| 10016 | 71906 |
| Value | Count | Frequency (%) |
| 10026 | 1413 | 0.2% |
| 10025 | 49129 | |
| 10024 | 38560 | 5.6% |
| 10023 | 110430 | |
| 10022 | 69864 | |
| 10021 | 2638 | 0.4% |
| 10020 | 1940 | 0.3% |
| 10019 | 13191 | 1.9% |
| 10018 | 33081 | 4.8% |
| 10017 | 43269 | 6.2% |
| Distinct | 93 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 78.33866209 |
| Minimum | 0 |
|---|---|
| Maximum | 18090 |
| Zeros | 101 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 30 |
| Q1 | 60 |
| median | 60 |
| Q3 | 90 |
| 95-th percentile | 180 |
| Maximum | 18090 |
| Range | 18090 |
| Interquartile range (IQR) | 30 |
Descriptive statistics
| Standard deviation | 51.11679274 |
|---|---|
| Coefficient of variation (CV) | 0.6525104128 |
| Kurtosis | 22228.308 |
| Mean | 78.33866209 |
| Median Absolute Deviation (MAD) | 30 |
| Skewness | 65.07331267 |
| Sum | 54399072 |
| Variance | 2612.9265 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 60 | 322857 | |
| 90 | 157517 | |
| 180 | 88938 | 12.8% |
| 30 | 82785 | 11.9% |
| 14 | 18881 | 2.7% |
| 120 | 10904 | 1.6% |
| 49 | 3828 | 0.6% |
| 15 | 3758 | 0.5% |
| 168 | 1048 | 0.2% |
| 7 | 1016 | 0.1% |
| Other values (83) | 2877 | 0.4% |
| Value | Count | Frequency (%) |
| 0 | 101 | < 0.1% |
| 1 | 8 | < 0.1% |
| 2 | 3 | < 0.1% |
| 3 | 7 | < 0.1% |
| 4 | 4 | < 0.1% |
| 5 | 3 | < 0.1% |
| 6 | 2 | < 0.1% |
| 7 | 1016 | |
| 9 | 6 | < 0.1% |
| 10 | 10 | < 0.1% |
| Value | Count | Frequency (%) |
| 18090 | 1 | < 0.1% |
| 1820 | 1 | < 0.1% |
| 1800 | 8 | < 0.1% |
| 1210 | 1 | < 0.1% |
| 1168 | 1 | < 0.1% |
| 960 | 89 | |
| 900 | 1 | < 0.1% |
| 810 | 1 | < 0.1% |
| 720 | 1 | < 0.1% |
| 600 | 12 | < 0.1% |
| Distinct | 17 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.219905416 |
| Minimum | 0 |
|---|---|
| Maximum | 960 |
| Zeros | 561743 |
| Zeros (%) | 80.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 960 |
| Range | 960 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1.995967749 |
|---|---|
| Coefficient of variation (CV) | 9.076482907 |
| Kurtosis | 94636.22191 |
| Mean | 0.219905416 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 234.8112474 |
| Sum | 152704.3 |
| Variance | 3.983887256 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 561743 | |
| 1 | 128884 | 18.6% |
| 2 | 1861 | 0.3% |
| 3 | 1771 | 0.3% |
| 90 | 127 | < 0.1% |
| 180 | 7 | < 0.1% |
| 120 | 3 | < 0.1% |
| 0.1 | 3 | < 0.1% |
| 15 | 2 | < 0.1% |
| 1.06 | 1 | < 0.1% |
| Other values (7) | 7 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 561743 | |
| 0.1 | 3 | < 0.1% |
| 1 | 128884 | 18.6% |
| 1.05 | 1 | < 0.1% |
| 1.06 | 1 | < 0.1% |
| 1.09 | 1 | < 0.1% |
| 1.8 | 1 | < 0.1% |
| 2 | 1861 | 0.3% |
| 3 | 1771 | 0.3% |
| 15 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 960 | 1 | < 0.1% |
| 650 | 1 | < 0.1% |
| 180 | 7 | < 0.1% |
| 120 | 3 | < 0.1% |
| 90 | 127 | < 0.1% |
| 60 | 1 | < 0.1% |
| 30 | 1 | < 0.1% |
| 15 | 2 | < 0.1% |
| 3 | 1771 | |
| 2 | 1861 |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.3 MiB |
| 0 |
|---|
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 694409 |
|---|---|
| Distinct characters | 1 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 694409 |
Length
Pie chart
| Value | Count | Frequency (%) |
| 0 | 694409 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 694409 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 694409 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 694409 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 694409 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 694409 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 694409 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 694409 |
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.2173344528 |
| Minimum | 0 |
|---|---|
| Maximum | 90 |
| Zeros | 547892 |
| Zeros (%) | 78.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 90 |
| Range | 90 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.4488841288 |
|---|---|
| Coefficient of variation (CV) | 2.065407132 |
| Kurtosis | 2363.535512 |
| Mean | 0.2173344528 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 14.04583624 |
| Sum | 150919 |
| Variance | 0.201496961 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 547892 | |
| 1 | 144030 | 20.7% |
| 3 | 1771 | 0.3% |
| 2 | 713 | 0.1% |
| 30 | 2 | < 0.1% |
| 90 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 547892 | |
| 1 | 144030 | 20.7% |
| 2 | 713 | 0.1% |
| 3 | 1771 | 0.3% |
| 30 | 2 | < 0.1% |
| 90 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 90 | 1 | < 0.1% |
| 30 | 2 | < 0.1% |
| 3 | 1771 | 0.3% |
| 2 | 713 | 0.1% |
| 1 | 144030 | 20.7% |
| 0 | 547892 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 502740 |
| Missing (%) | 72.4% |
| Memory size | 1.3 MiB |
| False | |
|---|---|
| True | 183 |
| (Missing) |
| Value | Count | Frequency (%) |
| False | 191486 | 27.6% |
| True | 183 | < 0.1% |
| (Missing) | 502740 |
| Distinct | 1 |
|---|---|
| Distinct (%) | 33.3% |
| Missing | 694406 |
| Missing (%) | > 99.9% |
| Memory size | 5.3 MiB |
| 6,1 |
|---|
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 9 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 6,1 |
|---|---|
| 2nd row | 6,1 |
| 3rd row | 6,1 |
Common Values
| Value | Count | Frequency (%) |
| 6,1 | 3 | < 0.1% |
| (Missing) | 694406 |
Length
Pie chart
| Value | Count | Frequency (%) |
| 6,1 | 3 |
Most occurring characters
| Value | Count | Frequency (%) |
| 6 | 3 | |
| , | 3 | |
| 1 | 3 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 6 | |
| Other Punctuation | 3 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 3 | |
| 1 | 3 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 9 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 6 | 3 | |
| , | 3 | |
| 1 | 3 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 6 | 3 | |
| , | 3 | |
| 1 | 3 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.3 MiB |
| 0 | |
|---|---|
| 1 | 703 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 694409 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 693706 | |
| 1 | 703 | 0.1% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 0 | 693706 | |
| 1 | 703 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 693706 | |
| 1 | 703 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 694409 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 693706 | |
| 1 | 703 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 694409 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 693706 | |
| 1 | 703 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 694409 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 693706 | |
| 1 | 703 | 0.1% |
ADHERENCE
Categorical
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.3 MiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 694409 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 624630 | |
| 1 | 69779 | 10.0% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 0 | 624630 | |
| 1 | 69779 | 10.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 624630 | |
| 1 | 69779 | 10.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 694409 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 624630 | |
| 1 | 69779 | 10.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 694409 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 624630 | |
| 1 | 69779 | 10.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 694409 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 624630 | |
| 1 | 69779 | 10.0% |
| Distinct | 13 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 515067 |
| Missing (%) | 74.2% |
| Memory size | 5.3 MiB |
| Same Facility Refill | |
|---|---|
| MMD | |
| Individual delivery/home-based | |
| MMS | 2906 |
| Different Facility Refill (Private hospital/clinic) | 167 |
| Other values (8) | 98 |
Length
| Max length | 51 |
|---|---|
| Median length | 20 |
| Mean length | 14.15255768 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2538148 |
|---|---|
| Distinct characters | 40 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | MMD |
|---|---|
| 2nd row | MMD |
| 3rd row | MMD |
| 4th row | MMD |
| 5th row | MMD |
Common Values
| Value | Count | Frequency (%) |
| Same Facility Refill | 95453 | 13.7% |
| MMD | 67085 | 9.7% |
| Individual delivery/home-based | 13633 | 2.0% |
| MMS | 2906 | 0.4% |
| Different Facility Refill (Private hospital/clinic) | 167 | < 0.1% |
| Other | 42 | < 0.1% |
| Fixed or ad hoc pick up points | 27 | < 0.1% |
| Mobile van/other vehicle | 10 | < 0.1% |
| PMVs/Chemists | 4 | < 0.1% |
| CPARP | 4 | < 0.1% |
| Other values (3) | 11 | < 0.1% |
| (Missing) | 515067 |
Length
| Value | Count | Frequency (%) |
| facility | 95620 | |
| refill | 95620 | |
| same | 95453 | |
| mmd | 67085 | |
| individual | 13633 | 3.5% |
| delivery/home-based | 13633 | 3.5% |
| mms | 2906 | 0.8% |
| private | 167 | < 0.1% |
| different | 167 | < 0.1% |
| hospital/clinic | 167 | < 0.1% |
| Other values (21) | 300 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 328711 | |
| l | 314488 | |
| e | 246239 | 9.7% |
| a | 218710 | 8.6% |
| 205409 | 8.1% | |
| M | 139996 | 5.5% |
| y | 109257 | 4.3% |
| m | 109090 | 4.3% |
| S | 98359 | 3.9% |
| t | 96212 | 3.8% |
| Other values (30) | 671677 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1794110 | |
| Uppercase Letter | 510820 | 20.1% |
| Space Separator | 205409 | 8.1% |
| Other Punctuation | 13830 | 0.5% |
| Dash Punctuation | 13637 | 0.5% |
| Open Punctuation | 171 | < 0.1% |
| Close Punctuation | 171 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 328711 | |
| l | 314488 | |
| e | 246239 | |
| a | 218710 | |
| y | 109257 | 6.1% |
| m | 109090 | 6.1% |
| t | 96212 | 5.4% |
| c | 96022 | 5.4% |
| f | 95954 | 5.3% |
| d | 54594 | 3.0% |
| Other values (11) | 124833 | 7.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 139996 | |
| S | 98359 | |
| F | 95647 | |
| R | 95631 | |
| D | 67260 | |
| I | 13633 | 2.7% |
| P | 187 | < 0.1% |
| O | 42 | < 0.1% |
| C | 30 | < 0.1% |
| A | 15 | < 0.1% |
| Other values (3) | 20 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 13822 | |
| , | 8 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 205409 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 13637 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 171 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 171 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2304930 | |
| Common | 233218 | 9.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 328711 | |
| l | 314488 | |
| e | 246239 | |
| a | 218710 | 9.5% |
| M | 139996 | 6.1% |
| y | 109257 | 4.7% |
| m | 109090 | 4.7% |
| S | 98359 | 4.3% |
| t | 96212 | 4.2% |
| c | 96022 | 4.2% |
| Other values (24) | 547846 |
Common
| Value | Count | Frequency (%) |
| 205409 | ||
| / | 13822 | 5.9% |
| - | 13637 | 5.8% |
| ( | 171 | 0.1% |
| ) | 171 | 0.1% |
| , | 8 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2538148 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 328711 | |
| l | 314488 | |
| e | 246239 | 9.7% |
| a | 218710 | 8.6% |
| 205409 | 8.1% | |
| M | 139996 | 5.5% |
| y | 109257 | 4.3% |
| m | 109090 | 4.3% |
| S | 98359 | 3.9% |
| t | 96212 | 3.8% |
| Other values (30) | 671677 |
| Distinct | 66 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.07961143937 |
| Minimum | 0 |
|---|---|
| Maximum | 67 |
| Zeros | 691674 |
| Zeros (%) | 99.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 67 |
| Range | 67 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1.394571974 |
|---|---|
| Coefficient of variation (CV) | 17.51723101 |
| Kurtosis | 517.9834294 |
| Mean | 0.07961143937 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 20.88914214 |
| Sum | 55282.9 |
| Variance | 1.944830991 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 691674 | |
| 30 | 187 | < 0.1% |
| 15 | 161 | < 0.1% |
| 25 | 161 | < 0.1% |
| 20 | 154 | < 0.1% |
| 18 | 123 | < 0.1% |
| 10 | 117 | < 0.1% |
| 12 | 115 | < 0.1% |
| 22 | 111 | < 0.1% |
| 8 | 104 | < 0.1% |
| Other values (56) | 1502 | 0.2% |
| Value | Count | Frequency (%) |
| 0 | 691674 | |
| 1 | 4 | < 0.1% |
| 3 | 11 | < 0.1% |
| 4 | 4 | < 0.1% |
| 5 | 18 | < 0.1% |
| 6 | 19 | < 0.1% |
| 6.8 | 4 | < 0.1% |
| 7 | 27 | < 0.1% |
| 7.5 | 4 | < 0.1% |
| 8 | 104 | < 0.1% |
| Value | Count | Frequency (%) |
| 67 | 4 | < 0.1% |
| 66 | 4 | < 0.1% |
| 65 | 4 | < 0.1% |
| 57 | 4 | < 0.1% |
| 56 | 6 | < 0.1% |
| 50 | 18 | |
| 49 | 11 | |
| 48 | 12 | |
| 46 | 8 | |
| 44 | 2 | < 0.1% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| State | L.G.A | Facility Name | Regimen Line | Regimen | PHARMACY_ID | PATIENT_ID | FACILITY_ID | DATE_VISIT | DURATION | MORNING | AFTERNOON | EVENING | ADR_SCREENED | ADR_IDS | PRESCRIP_ERROR | ADHERENCE | NEXT_APPOINTMENT | DMOC_TYPE | BODY_WEIGHT | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | Niger | Magama | Rural Hosp- Auna | ART First Line Adult | TDF(300mg)+3TC(300mg)+DTG(50mg) | 30577 | 8232 | 10011 | 2020-03-16 00:00:00 | 90 | 0.0 | 0 | 1 | NaN | NaN | 0 | 0 | 2020-07-16 00:00:00 | MMD | 0.0 |
| 1 | Niger | Magama | Rural Hosp- Auna | ART First Line Adult | TDF(300mg)+3TC(300mg)+DTG(50mg) | 30578 | 8219 | 10011 | 2020-03-02 00:00:00 | 30 | 1.0 | 0 | 0 | NaN | NaN | 0 | 0 | 2020-04-02 00:00:00 | NaN | 0.0 |
| 2 | Niger | Magama | Rural Hosp- Auna | ART First Line Adult | TDF(300mg)+3TC(300mg)+DTG(50mg) | 30579 | 8246 | 10011 | 2020-02-26 00:00:00 | 90 | 0.0 | 0 | 1 | NaN | NaN | 0 | 0 | 2020-05-17 00:00:00 | MMD | 0.0 |
| 3 | Niger | Magama | Rural Hosp- Auna | ART First Line Adult | TDF(300mg)+3TC(300mg)+DTG(50mg) | 30580 | 8217 | 10011 | 2020-06-06 00:00:00 | 90 | 0.0 | 0 | 1 | NaN | NaN | 0 | 0 | 2020-09-04 00:00:00 | MMD | 0.0 |
| 4 | Niger | Magama | Rural Hosp- Auna | ART First Line Adult | TDF(300mg)+3TC(300mg)+DTG(50mg) | 30581 | 8243 | 10011 | 2020-04-18 00:00:00 | 90 | 0.0 | 0 | 1 | NaN | NaN | 0 | 0 | 2020-07-16 00:00:00 | MMD | 0.0 |
| 5 | Niger | Magama | Rural Hosp- Auna | ART First Line Adult | TDF(300mg)+3TC(300mg)+DTG(50mg) | 30582 | 8256 | 10011 | 2020-03-02 00:00:00 | 30 | 0.0 | 0 | 1 | NaN | NaN | 0 | 0 | 2020-04-02 00:00:00 | NaN | 0.0 |
| 6 | Niger | Magama | Rural Hosp- Auna | ART First Line Adult | TDF(300mg)+3TC(300mg)+DTG(50mg) | 30583 | 8246 | 10011 | 2020-02-11 00:00:00 | 90 | 0.0 | 0 | 0 | NaN | NaN | 0 | 0 | 2020-05-11 00:00:00 | NaN | 0.0 |
| 7 | Niger | Magama | Rural Hosp- Auna | ART First Line Adult | TDF(300mg)+3TC(300mg)+LPV/r(200/50mg) | 30584 | 8240 | 10011 | 2017-05-27 00:00:00 | 30 | 0.0 | 0 | 0 | NaN | NaN | 0 | 0 | 2017-06-26 00:00:00 | NaN | 0.0 |
| 8 | Niger | Magama | Rural Hosp- Auna | ART First Line Adult | TDF(300mg)+3TC(300mg)+DTG(50mg) | 30585 | 8230 | 10011 | 2020-05-08 00:00:00 | 90 | 0.0 | 0 | 1 | NaN | NaN | 0 | 0 | 2020-08-05 00:00:00 | MMD | 0.0 |
| 9 | Niger | Magama | Rural Hosp- Auna | ART First Line Adult | TDF(300mg)+3TC(300mg)+DTG(50mg) | 30586 | 8223 | 10011 | 2020-02-26 00:00:00 | 30 | 0.0 | 0 | 1 | NaN | NaN | 0 | 0 | 2020-03-26 00:00:00 | NaN | 0.0 |
Last rows
| State | L.G.A | Facility Name | Regimen Line | Regimen | PHARMACY_ID | PATIENT_ID | FACILITY_ID | DATE_VISIT | DURATION | MORNING | AFTERNOON | EVENING | ADR_SCREENED | ADR_IDS | PRESCRIP_ERROR | ADHERENCE | NEXT_APPOINTMENT | DMOC_TYPE | BODY_WEIGHT | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 694399 | Niger | Kontagora | General Hospital Kontagora | Cotrimoxazole (CTX) Prophylaxis | Cotrimoxazole 960mg | 4082170 | 160836 | 3005 | 2021-05-28 00:00:00 | 90 | 1.0 | 0 | 0 | No | NaN | 0 | 0 | 2021-08-25 00:00:00 | Same Facility Refill | 0.0 |
| 694400 | Niger | Kontagora | General Hospital Kontagora | Isoniazid Preventive Therapy (IPT) | Isoniazid 300mg | 4082173 | 144877 | 3005 | 2020-08-28 00:00:00 | 15 | 1.0 | 0 | 0 | No | NaN | 0 | 0 | 2020-09-10 00:00:00 | Same Facility Refill | 0.0 |
| 694401 | Niger | Kontagora | General Hospital Kontagora | ART Second Line Adult | TDF(300mg)+3TC(150mg)+ATV/r(300/100mg) | 4082177 | 145221 | 3005 | 2021-05-27 00:00:00 | 90 | 1.0 | 0 | 0 | No | NaN | 0 | 0 | 2021-09-06 00:00:00 | Same Facility Refill | 0.0 |
| 694402 | Niger | Borgu | Wawa BHC | ART First Line Adult | TDF(300mg)+3TC(300mg)+DTG(50mg) | 4082387 | 160854 | 10021 | 2021-05-30 00:00:00 | 180 | 0.0 | 0 | 1 | No | NaN | 0 | 0 | 2021-10-30 00:00:00 | Same Facility Refill | 0.0 |
| 694403 | Niger | Borgu | Wawa BHC | ART First Line Adult | TDF(300mg)+3TC(300mg)+DTG(50mg) | 4082388 | 160854 | 10021 | 2021-05-30 00:00:00 | 180 | 0.0 | 0 | 1 | No | NaN | 0 | 0 | 2021-10-30 00:00:00 | Same Facility Refill | 0.0 |
| 694404 | Niger | Borgu | Wawa BHC | ART First Line Adult | TDF(300mg)+3TC(300mg)+DTG(50mg) | 4082389 | 160854 | 10021 | 2021-05-30 00:00:00 | 180 | 1.0 | 0 | 0 | No | NaN | 0 | 0 | 2021-10-30 00:00:00 | Same Facility Refill | 0.0 |
| 694405 | Niger | Lapai | General Hospital - Lapai | ART First Line Adult | TDF(300mg)+3TC(300mg)+DTG(50mg) | 4082393 | 131562 | 10016 | 2021-05-06 00:00:00 | 180 | 1.0 | 0 | 0 | No | NaN | 0 | 0 | 2021-10-21 00:00:00 | Same Facility Refill | 0.0 |
| 694406 | Niger | Lapai | General Hospital - Lapai | ART First Line Adult | TDF(300mg)+3TC(300mg)+DTG(50mg) | 4082394 | 131562 | 10016 | 2021-05-06 00:00:00 | 180 | 0.0 | 0 | 1 | No | NaN | 0 | 0 | 2021-10-21 00:00:00 | Same Facility Refill | 0.0 |
| 694407 | Niger | Lapai | General Hospital - Lapai | Cotrimoxazole (CTX) Prophylaxis | Cotrimoxazole 960mg | 4082395 | 131562 | 10016 | 2021-05-06 00:00:00 | 180 | 1.0 | 0 | 0 | No | NaN | 0 | 0 | 2021-10-21 00:00:00 | Same Facility Refill | 0.0 |
| 694408 | Niger | Lapai | General Hospital - Lapai | ART First Line Adult | TDF(300mg)+3TC(300mg)+DTG(50mg) | 4082396 | 131562 | 10016 | 2021-05-06 00:00:00 | 180 | 0.0 | 0 | 1 | No | NaN | 0 | 0 | 2021-10-21 00:00:00 | Same Facility Refill | 0.0 |